Separation Structure Speech Separation Model Fusing Conformer and NBC
نویسندگان
چکیده
Abstract The quality of speech separation affects the entire technology ecosystem. Aiming at problems low utilization local feature information, insufficient convergence speed, too many calculation parameters and long time in blind source view Transformer dual-path cyclic neural network, a model based on fusion Conformer NBC (Narrow-band NBC) is proposed. First, for information block replaced by recurrent network. It can improve speed features. Secondly, used to replace inter-block loop. simplifies vector similarity aggregation, reduces lots cost, computational complexity model. In experiments WSJ0-2mix [7] dataset WHAM [13] dataset, contrast with other models, faster effect better.
منابع مشابه
Evaluating Speech Separation Systems
Common evaluation standards are critical to making progress in any field, but they can also distort research by shifting all the attention to a limited subset of the problem. Here, we consider the problem of evaluating algorithms for speech separation and acoustic scene analysis, noting some weaknesses of existing measures, and making some suggestions for future evaluations. We take the positio...
متن کاملMonaural Speech Separation
Monaural speech separation has been studied in previous systems that incorporate auditory scene analysis principles. A major problem for these systems is their inability to deal with speech in the highfrequency range. Psychoacoustic evidence suggests that different perceptual mechanisms are involved in handling resolved and unresolved harmonics. Motivated by this, we propose a model for monaura...
متن کاملSupervised Speech Separation and Processing
In real-world environments, speech often occurs simultaneously with acoustic interference, such as background noise or reverberation. The interference usually leads to adverse effects on speech perception, and results in performance degradation in many speech applications, including automatic speech recognition and speaker identification. Monaural speech separation and processing aim to separat...
متن کاملThe 2nd ‘chime’ Speech Separation and Recognition Challenge: Approaches on Single-channel Source Separation and Model-driven Speech Enhancement
In this paper, we address the small vocabulary track (track 1) described in the CHiME 2 challenge dedicated to recognize utterances of a target speaker with small head movements. The utterances are recorded in a reverberant room acoustics corrupted with highly non-stationary noise sources. Such adverse noise scenario imposes a challenge to state-of-the-art automatic speech recognition systems. ...
متن کاملMonaural speech/music source separation using discrete energy separation algorithm
In this paper, we address the problem of monaural source separation of a mixed signal containing speech and music components. We use Discrete Energy Separation Algorithm (DESA) to estimate frequency-modulating (FM) signal energy. The FM signal energy is used to design a time-varying filter in the time–frequency domain for rejecting the interfering signal. The FM signal energy was chosen due to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of physics
سال: 2022
ISSN: ['0022-3700', '1747-3721', '0368-3508', '1747-3713']
DOI: https://doi.org/10.1088/1742-6596/2384/1/012034